A Novel Stroke Width Based Binarization Method to Handle Closely Spaced Thick Characters

نویسندگان

  • P. Pavan Kumar
  • Atul Negi
  • B. L. Deekshatulu
  • Chakravarthy Bhagvati
  • Arun Agarwal
  • Rafael C. Gonzalez
  • Richard E. Woods
  • Jitendra Malik
  • Liying Fan
  • Chew Lim Tan
  • Lixin Fan
  • Xiangyun Ye
  • Mohamed Cheriet
  • Ching Y. Suen
چکیده

Signboards and billboards provide a challenge to image seg¬mentation methods, since these images may also have pictures and graphical objects, apart from text objects. Methods that often succeed in more traditional text block segmentation situations do not perform well here since estimation of text lines and character widths etc fail due to the short sample sizes. Further, extraction of characters of different font sizes, which can be found in the real world and signboard images, remains a problem. In this paper, as a solution to the mentioned problem, we propose two stroke width based binarization approaches. These approaches can be used to eliminate extraneous objects based upon estimates of stroke width. We compare our methods with several other stroke width based binarization methods. We observe that the previous approaches fail, when there are closely spaced thick characters. We show that our second approach is able to extract closely spaced thick characters better than

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Shape based local thresholding for binarization of document images

This paper presents a novel local threshold algorithm for the binarization of document images. Stroke width of handwritten and printed characters in documents is utilized as the shape feature. As a result, in addition to the intensity analysis, the proposed algorithm introduces the stroke width as shape information into local thresholding. Experimental results for both synthetic and practical d...

متن کامل

Stroke Width-Based Contrast Feature for Document Image Binarization

Automatic segmentation of foreground text from the background in degraded document images is very much essential for the smooth reading of the document content and recognition tasks by machine. In this paper, we present a novel approach to the binarization of degraded document images. The proposed method uses a new local contrast feature extracted based on the stroke width of text. First, a pre...

متن کامل

A Binarization Method for Degraded Document Images with Morphological Operations

In this paper, we propose an effective binarization method for de-graded document images in this paper. This method employs morphological operations throughout its algorithm to suppress uneven illumination in the background region, to detect the character location and to reconstruct text regions. Moreover, a technique for estimating stroke width of characters is introduced to remove noises in a...

متن کامل

Uniqueness of bilevel image degradations

Two major degradations, edge displacement and comer erosion, change the appearance of bilevel images. The displacement of an edge determines stroke width, and the erosion ofa comer affects crispness. These degradations are functions of the system parameters: the point spread function (PSF) width and functional form, and the binarization threshold. Changing each of these parameters will affect a...

متن کامل

Evolution maps and applications

Common tasks in document analysis, such as binarization, line extraction etc., are still considered difficult for highly degraded text documents. Having reliable fundamental information regarding the characters of the document, such as the distribution of character dimensions and stroke width, can significantly improve the performance of these tasks. We introduce a novel perspective of the imag...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010